Précis of statistical significance: rationale, validity, and utility.

نویسنده

  • S L Chow
چکیده

The null-hypothesis significance-test procedure (NHSTP) is defended in the context of the theory-corroboration experiment, as well as the following contrasts: (a) substantive hypotheses versus statistical hypotheses, (b) theory corroboration versus statistical hypothesis testing, (c) theoretical inference versus statistical decision, (d) experiments versus nonexperimental studies, and (e) theory corroboration versus treatment assessment. The null hypothesis can be true because it is the hypothesis that errors are randomly distributed in data. Moreover, the null hypothesis is never used as a categorical proposition. Statistical significance means only that chance influences can be excluded as an explanation of data; it does not identify the nonchance factor responsible. The experimental conclusion is drawn with the inductive principle underlying the experimental design. A chain of deductive arguments gives rise to the theoretical conclusion via the experimental conclusion. The anomalous relationship between statistical significance and the effect size often used to criticize NHSTP is more apparent than real. The absolute size of the effect is not an index of evidential support for the substantive hypothesis. Nor is the effect size, by itself, informative as to the practical importance of the research result. Being a conditional probability, statistical power cannot be the a priori probability of statistical significance. The validity of statistical power is debatable because statistical significance is determined with a single sampling distribution of the test statistic based on H0, whereas it takes two distributions to represent statistical power or effect size. Sample size should not be determined in the mechanical manner envisaged in power analysis. It is inappropriate to criticize NHSTP for nonstatistical reasons. At the same time, neither effect size, nor confidence interval estimate, nor posterior probability can be used to exclude chance as an explanation of data. Neither can any of them fulfill the nonstatistical functions expected of them by critics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining itemset utilities from transaction databases

The rationale behind mining frequent itemsets is that only itemsets with high frequency are of interest to users. However, the practical usefulness of frequent itemsets is limited by the significance of the discovered itemsets. A frequent itemset only reflects the statistical correlation between items, and it does not reflect the semantic significance of the items. In this paper, we propose a u...

متن کامل

Effect of Précis Writing Instruction on the Creation of Cohesive Text by Iranian High School EFL Learners

Being expert in establishing cohesion and coherence in writing is not an easy task. The EFL learners are to pass through very long, uneven paths such as précis exercise to achieve this skill. The present study was launched to explore the effect of précis writing on the creation of a compact text. To this end, a true-experimental method of research with the pretest-posttest control design was em...

متن کامل

Content Evaluation of the Pre-marriage Education Program Provided by the State Welfare Organization of Iran: The Perspective of Marriage Experts and Educators

  Content Evaluation of the Pre-marriage Education Program Provided by the State Welfare Organization of Iran: The Perspective of Marriage Experts and Educators Pegah Goodarzi1, Shahram Vaziri2 *, Saeed Akbari Zardkhaneh3 1Ph.D. Student, Department of Health Psychology, Karaj Branch, Islamic Azad University, Karaj, Iran 2Associate Professor of Clinical Psychology, Department of Clinical Psycho...

متن کامل

The Other Half of the Story: Effect Size Analysis in Quantitative Research

Statistical significance testing is the cornerstone of quantitative research, but studies that fail to report measures of effect size are potentially missing a robust part of the analysis. We provide a rationale for why effect size measures should be included in quantitative discipline-based education research. Examples from both biological and educational research demonstrate the utility of ef...

متن کامل

Validity of the Iranian Version of Health Utility Index Mark 3 Quality of Life Questionnaire

Background: The aim of this study was to standardize and develop the health utility index III (HUI3); quality of life questionnaire. This study was conducted for the first time in Iran. Method: Forward-backward translation method was applied in order to translate the Canadian version into Persian. The final version was developed after modifica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • The Behavioral and brain sciences

دوره 21 2  شماره 

صفحات  -

تاریخ انتشار 1998